ECESS Inter-Module Interface Specification for Speech Synthesis

نویسندگان

Javier Pérez

Antonio Bonafonte

Horst-Udo Hain

Eric Keller

Stefan Breuer

Jilei Tian

چکیده

The newly founded European Centre of Excellence for Speech Synthesis (ECESS) (ECESS, 2004) is an initiative to promote the development of the European research area (ERA) in the field of Language Technology. ECESS focuses on the great challenge of high-quality speech synthesis which is of crucial importance for future spoken-language technologies. The main goals of ECESS are to achieve the critical mass needed to promote progress in TTS technology substantially, to integrate basic research know-how related to speech synthesis and to attract public and private funding. To this end, a common system architecture based on exchangeable modules supplied by the ECESS members is to be established. The XML-based interface that connects these modules is the topic of this paper.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ECESS Platform for Web Based TTS Modules and Systems Evaluation

The paper presents platform for web based TTS modules and systems evaluation named RES (Remote Evaluation System). It is being developed within the European Centre of Excellence for Speech Synthesis (ECESS, www.ecess.eu). The presented platform will be used for web based online evaluation of various text-to-speech (TTS) modules, and even complete TTS systems, presently running at different Inst...

متن کامل

Evaluation of Modules and Tools for Speech Synthesis: the ECESS Framework

The consortium ECESS (European Center of Excellence for Speech Synthesis) has set up a framework for evaluation of software modules and tools relevant for speech synthesis. Till now two lines of evaluation campaigns have been established: Evaluation of the ECESS TTS modules (text processing, prosody, acoustic synthesis) Evaluation of ECESS tools (pitch extraction, voice activity detection, phon...

متن کامل

Extensible infrastructure for a 3D face and vocal-tract model

We describe an architecture for a combined 3D face and vocal tract animation simulator for articulatory speech synthesis. The architecture provides five main modules: 1. a simulator engine, 2. a 3D geometry module 3. a graphical user interface (GUI) module, 4 a synthesis engine and 5. a numerics engine. Elements of the model are specified using nodes placed hierarchically in a scene graph. Trav...

متن کامل

Visual Specification of Interprocess and Intraprocess Communication

We present a visual specification language for constructing distributed applications and their direct manipulation graphical user interfaces. Each distributed application consists of a collection of independent modules and a configuration of logical connections that define communication among the data interfaces of the modules. Our specification language uses a single visual mechanism that allo...

متن کامل

Language Synthesis Using Image Placement; a Practical Implementation of Language Agnostic, Imagery Based Speech Synthesis

We show the applicability of language agnostic interface for the educational field using NaturalOWL for its language generation. Visual information is extracted from the graphical user interface and processed into plain or annotated text that can be synthesized to speech.

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

ECESS Inter-Module Interface Specification for Speech Synthesis

نویسندگان

چکیده

منابع مشابه

ECESS Platform for Web Based TTS Modules and Systems Evaluation

Evaluation of Modules and Tools for Speech Synthesis: the ECESS Framework

Extensible infrastructure for a 3D face and vocal-tract model

Visual Specification of Interprocess and Intraprocess Communication

Language Synthesis Using Image Placement; a Practical Implementation of Language Agnostic, Imagery Based Speech Synthesis

عنوان ژورنال:

اشتراک گذاری